PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_010547379.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Cleomaceae; Tarenaya
Family MYB
Protein Properties Length: 1685aa    MW: 184713 Da    PI: 6.4441
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_010547379.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding33.51e-10800842246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      ++WT+eE+e++v+ ++ +G++ +++Ia+ +  ++t  +c+++++k
   XP_010547379.1 800 NPWTKEEEEIFVEKFATYGKD-FRRIASFLD-HKTTADCIEFYYK 842
                      69*****************99.*********.***********98 PP

2Myb_DNA-binding25.82.4e-0810141055245
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                       g WT eE  l+ +a  ++G++ +++I+r++  +++++ c+ ++ 
   XP_010547379.1 1014 GDWTDEERSLFMHALSLYGKD-FASISRCIR-TKSREKCRIFFS 1055
                       78*****************99.*********.*******98775 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.03E-15785846IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.5E-7792844IPR009057Homeodomain-like
PROSITE profilePS5129317.202797848IPR017884SANT domain
SMARTSM007173.0E-12798846IPR001005SANT/Myb domain
PfamPF002491.9E-8800842IPR001005SANT/Myb domain
PROSITE profilePS5129313.07610111062IPR017884SANT domain
SMARTSM007171.2E-710121060IPR001005SANT/Myb domain
PfamPF002495.9E-710141054IPR001005SANT/Myb domain
SuperFamilySSF466891.48E-910141063IPR009057Homeodomain-like
CDDcd001671.25E-510161054No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1685 aa     Download sequence    Send to blast
MPPESLPWDR KDDFREKKHE RPESFGSNPR WRNSSAASYS PHYGSRDFPR WGSPDFRRPP  60
YSGKQGSWEQ FEETSSHGCA ASRSGGRTFG NVTYQSSVSR GDFRHCRNIR DNRGSFVQRD  120
WKPHAWVANN GATNAFGRPF DMITDQRLAE NIPPCPASQS DTMKTWDQFR DKQDNKVSGN  180
NGLNAGQKCE LETSLSSVDW KPLKWTRSGS LCSRSSGFSN SSSSKSLGAV DSSEMKDENL  240
RSSATLLQSP PGNATPCVAS AVSSDEAISR KKPRLGWGEG LAKYEKRKVD GPDVNVQKDG  300
PTLSTNSSEA APSLVSSLVD KSPRILGFTD CASPATPSSI ACSSSPSADD NLYRKAANAN  360
SEPNNMSGLP SLGVWNQLEG FSFNLENLDD VSMANLSSSL NWLLHLADPG SVDCCFASST  420
AMSRLLFWKG DVLKTLEVAE SEIDLLENEL KRIKSESGDF SLCPASSSSL AVDVNTNLSK  480
ELEAVSTFIL RPAPLQLTCS ADEVVDKTPT GCSGLEEHPA NGKAEDLDSP GTATSKLLEP  540
LSLANSLLVS KFEFFVEDSG NSIKNQSQDL LLSCNESCLA GNDDVNAPLN SENIELASGG  600
SRSDDGDDIQ CKNIMLCNKE SARQASEVLI KLLPGDFRSD NALDLSDIVG SPDSVLVRQR  660
ITARRRMMRF KEKVLAIKFK AFLNAWRKDM HQLSMRRCRP RSQKKVDSGL RMINGGYTKN  720
RTHSRSRLSS PGNLCQISFP EMVSLTSEIL SNSPQRPYRN VLKMPEIILE DKEKIFSRFI  780
STNGLIEDPC AVEKERARMN PWTKEEEEIF VEKFATYGKD FRRIASFLDH KTTADCIEFY  840
YKNHKSDCFK KIKKCGNNKQ EKCAANTYLV TSSKKWNRDT NAVSLNILGA VSAMAAHADC  900
KTASGSRFSV RITSGKRIES KMSQADDCIE RSSSFDLPEK ESFAADILAG FCGSLSSEAM  960
SSCITSCVDP GEICRDWKFQ RMDSSRKRCS VSDITQTAND DTCSDDSCGE TDSGDWTDEE  1020
RSLFMHALSL YGKDFASISR CIRTKSREKC RIFFSKARKR LGLDSILLPR NVGSSVSDDA  1080
DGDGSDSEDA CVLGTSSAIC NDVVDAKRDE DLSGSPFKIN QDACLLGPVK LETDLNMSEV  1140
EDVRGILDRS IHLDTFGVDD KHQGHHGVAN ESVNGNCVKI QAQPDQQAQE DTVLSTDAEK  1200
SMDSASQYHA SSCAVASFAP CSSLSDELPA SDIVKVTMET RKEDSVLDKS VMSITNLNED  1260
ACGKHDGRNF VQDSLVNNAN ATNNQEADTS SFSGFNNFDC QPQVSVQRRL HIPSLQEQAT  1320
ISVKLESPGH AILAALDKPG NDVSLTTNVE KAKPDHEFVG RCHHLQNVAA PHVPTSHILS  1380
DHALRISTKK EMDSDVYWGQ VREVQMNLKS DAGIGNECVA QGFSLRKCNG LINEQKVLSL  1440
EQKSGTDKSC KNGDVKLFGK ILTPHENEGK GLGLDSKQSG KPPTLNFAVQ KSADGQPANV  1500
GRDKVDYSGH ENVPIRSYGF WDGTRIQTGL SSLPDSAILL AKYPAAFANY SASSSAKEQQ  1560
VSHSLAKNYE HNLNGVSIYP ARETYGSDRV LVSHMFRNHD GNRTQPFSLG RDRVFSEVQR  1620
RNRFEAASGK GMVGMTTADP TGGVSDPVAA IKMHYVKAEQ YMAWHGNGNV GAGKEPWKRN  1680
GDIGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C4e-17759850494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D4e-17759850494NUCLEAR RECEPTOR COREPRESSOR 2
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1270287KKPRLGWGEGLAKYEKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010547379.10.0PREDICTED: uncharacterized protein LOC104819148 isoform X1
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-176MYB family protein